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INFLUENZA STUDIES. 

By Raymond Pearl, Ph. D., D. Sc., LL. D., Professor of Biometry and Vita! Statistics, School of 
Hygiene and Public Health, Johns Hopkins University; Consultant in Vital Statist ics and Epidem- 
iology, United States Public Health Service. 

II. FURTHER DATA ON THE CORRELATION OF EXPLOSIVENESS OF 
OUTBREAK OF THE 1918 EPIDEMIC 

I. Introduction. 

In the first of these Studies 2 it was shown that there was a definite 
and sensible net con-elation between explosivehess of outbreak of 
the epidemic, as measured by an epidemicity index, and the normal 
death rate from certain organic and chronic diseases. Because of 
the importance of the subject it has been thought desirable to 
reexamine critically the data, making use of more refined quanti- 
tative measures of the several variables dealt with. It is the object 
of the present paper to give the results of this re-study of the problem. 
As before, the basic data are from the large American cities for 
which weekly data were furnished during the epidemic, by the 
Bureau of the Census. 5 There is now in progress in this laboratory 
an extended study of the same problems on the basis of data from 
the 96 great towns of England and Wales, as well as further studies 
oh the American data. 

Before taking up the detailed matters of the present study, I 
should like to call attention briefly to some methodological con- 
siderations which lie at the foundation of this and other papers in 
this series. It is hoped that in this way the nonmathematical 
reader may gain a more adequate conception of the real meaning 
of the results. 

The most useful general method of acquiring knowledge of dynamic 
phenomena is unquestionably the experimental method. When we 
deal with phenomena of human biology, there is a wide range of 
matters in which the laboratory experimental method is, irt the nature 
of the case, ruled out. Unfortunately, one can not breed homozygous 
strains of men at will for experimental purposes, nor subject them 

1 Papers from the Department of Biometry and Vital Statistics, School of Hygiene and Public Health, 
Johns Hopkins University, No . 21. 

-Pearl, B., Influenza Studies. I. On Certain Genera! Statistical Aspects of the 191? Epidemic in 
American Cities: Public Health Deports, vol. 34, No. 32, pp. 1743-1783, 1919. Reprint No. 54S, pp. 1-43. 
All citations in this paper are in terms of the pagination of the reprint. . 

'It is a pleasure to express my indebtedness, for help with the extremely laborious computation of 
multiple correlation coefficients used in this work, to my assistants Mr. John Rice Miner, Mrs. Charmian 
Howell, and Miss Agnes L. Bacon. 
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methodically to desired environmental conditions. In studying 
most problems of human biology resort must be had to some form of 
the statistical method. This is fundamentally a descriptive method, 
and hence in many of its phases is ill adapted to the analysis of 
dynamically active events. There is, however, one branch of the 
statistical calculus which offers certain methodological possibilities 
which I think have been generally overlooked. Let us consider this. 

The essence of the experimental method, as practiced in the 
laboratory, and in theory, is that, of the multitude of variables 
conditioning a phenomenon, as many as possible are, by appropriate 
methods, held constant while one variable (or at most a very few 
selected variables) is allowed to vary and the results are noted. One 
may then deduce the relative significance of the selected variable 
in determining the phenomenon under observation. Now we 
frequently hear in scientific discussions about the experiments that 
Nature makes. Actually the true conditions of an experiment are 
rarely if ever realized in the course of natural events. It is just because 
Nature permits manifold and haphazard changes in all variables at 
the same time that recourse inust be bad to the method of experi- 
mental control in the laboratory. What is needed in order to 
interpret the results, in the experimental sense, and determine the 
meaning of the manifold and ceaseless changes and variations in the 
flow of naturally determined events, is some method of picking out 
of the manifold some selected constant conditions of a series of vari- 
ables, and then measuring the extent and character of the variations 
in a single selected variable, whose true relative influence upon the 
phenomenon it is desired to know,, while all these other variables 
are held constant. If this can be done we shall have realized all 
the epistemological advantages of the experimental method as 
practiced in the laboratory, and have freed ourselves at the same 
time from the limitations which in so many cases inhere in the 
material itself and make the laboratory type of experimental inquiry 
impossible. In other words, we shall have let Nature perform the 
experiment, in the sense of determining the phenomena, in her 
own way, while we evaluate the results in critically analytical terms of 
precisely the same sort and meaning as those in which we evaluate 
the results of a laboratory experiment. 

Now exactly this epistemological boon is actually afforded in the 
method of partial or net correlation, if properly handled. This 
analytically simple, if geometrically complex, calculus enables one, 
out of a manifold complex of variables operating in an entirely 
uncon trolled and natural maimer, to determine the variation of any 
selected single variable, or the correlation of any selected pair for 
constant conditions or values of the other variables in the complex. 
I judge that the epistemological possibilities of this method are not 
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yet fully grasped by scientific men generally. When they are I 
believe it will rank as a fundamental method of acquiring knowl- 
edge, combining certain of the advantages of the descriptive or 
historical and of the experimental methods. It seems to mo much 
more effectively to justify Eoyce's * eulogy of the statistical method 
than any of the arguments which ho advanced. The best elementary, 
but at the same time adequate, account "of the method is that of 
Yule. 5 

The problem with which this paper has to do may be stated in 
this way: It is an obvious fact that the largo American cities varied 
enormously among themselves in respect to the explosivcness of 
outbreak of epidemic mortality in the autumn of 1918. What 
factors, environmental or other, were significant in determining or 
influencing this variation? Or, put in another way, what factors 
of the numerous other respects in which these 34 large, cities differ 
from one another can be shown to be significantly correlated with 
the observed differences in explosivoiiess of outbreak of epidemic 
mortality ? 

II. Variables Discussed. 

The variable phenomena or attributes discussed in the present 
paper are listed below, together with the subscript numbers by which 
they will be designated in this and subsequent papers in this series. 

Subscript No. Variable. 

I. Explosiveness of outbreak of epidemic mortality as measured by an epidemieiry 

index I 8 . 
?>&. Normal death rate from pulmonary tuberculosis. 
Hb. Normal death rate from organic diseases of the heart, 
lie. Normal death rate from acute nephritis and Bright's disease. 
I'd. Normal death rate from typhoid fever. 
:Se. Normal death rate from cancer and other malignant tumors. 
'il. Normal death rate from all causes. 

4. Age distribution of population. 

5. Sex ratio of population . 
(5. Density of population. 

7. Latitude. 

8. Longitude. 

'J. Rate of growth of population, 1900-1910. 

In the following paragraphs these variables are defined and dis- 
cussed in detail. 

1. In the first of these Studies a number of indices of explosive- 
ness of outbreak of epidemic mortality were considered and the one 
finally adopted, 7 3 , was called the peak-time ratio and defined as 
follows: 

T P- M 
^ T— 

' Hoyce, J., The Mechanical, tho Historical, and the Statistical: Science, N. S. 1314. 

» l r ule, Q. Udny, Au Introduction to tho Theory of Statistics, Fifth Edition, pp. 228-203, London, liH'J. 
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where P denotes the maximum peak mortality rate observed during 
the duration, T, of the epidemic, which was defined as follows: 
"The epidemic mortality was considered to have begun in any 
city on the date when the mortality curve for that city first passed 
outside the range of fluctuation exhibited by the curve between the 
week ending July 6, 1918, and the end of the week immediately 
preceding the epidemic rise of the curve. The mortality of the 
first epidemic outbreak was considered to have ended on the date 
when the curve again passed within the same range of fluctuation." 
M' is the mean weekly annual death rate in the period from July 6, 
1918, to the outbreak of the epidemic. '■ 

This index, I :> , increases as the explosivencss of the outbreak in- 
creases. After the publication of the paper, however, it was pointed 
out by Dr. W. H. Frost and Mr. Edgar Sydenstricker, of the United 
States Public Health Service, that it was open to some criticism as a 
measure of explosivencss of epidemic outbreak, in the strictest sense 
of the term. The point of criticism was that inasmuch as T included 
the whole time within which the mortality curve was outside the nor- 
mal range, the value of the index would be influenced by both the 
ascending and descending limbs of the epidemic curve; whereas if it 
is strict cxplosiveness of outbreak that we wish to measure, only the 
ascending limb is of moment. Reflection shows that the point is 
well taken, and consequently in the present study we have used an 
index. /„, for which the symbolic expression is: 

P - M' 

~ 7" '"' 

Hero the letters have the same significance as before except that T 
is the number of weeks elapsed between (a) the date when the mor- 
tality curve first passed outside the range of fluctuation exhibited by 
the curve between the week ended July 6, 1918, and the end of the 
week immediately preceding the epidemic rise of the curve, and (b) 
the week in which the mortality curve attained its first epidemic peak. 
In other words, there is now included in the epidemicity index only 
the ascending limb of the epidemic curve. As a matter of fact, in 
the American cities here dealt with, little practical difference is made 
in any conclusions regarding the epidemic whether one uses I 6 or I K , 
but there can be no question that theoretically 7 8 is the superior 
value, and consequently wo have used it in this part and the follow- 
ing parts of these Studies. 

2. The subscript 2 refers to total destructiveness of the epidemic, 
a variable not discussed in the present paper, but defined in the next 
following of these Studies. 

3. The subscript 3 refers to a normal death rate in the community 
from one of the causes specified by letters. In the earlier study the 
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doath rate of the community for a single year was taken as indicative 
of the normal for the purposes of that work. In the present study 
wo have taken instead the mean annual death rate from each of the 
specified causes for the three years 1915, 1916, and 1917. It is beyond 
question that the throe-year average will give a much more accurate 
representation of the! prevailing normal rate of mortality from each 
of those diseases in the community just preceding the epidemic than 
will the rate for any, single year. 

4. Age distribution of the population. In the first of these Studies 
there was used, as a single numerical index of the age distribution, a 
quantity which measured the extent fo which each city deviates in 
the age constitution of its population' from a fixed- standard, but did 
not tell the nature or kind of deviation. Since the publication of 
that paper there has been devised a new and more adequate index 6 
of the ago constitution of the population. Wo have adopted as an 
age-constitution index the function 



<*>=#{|}( 3/- i¥ p ) 



where A is the deviation for each of six ago groups (viz, 0-4,, 5-14, 
15-24, 25-44, 45-04, 65 and over) of tho percentage of the actual 
population of each city in 1910 in each age group, from tho percentage 
in the same group in the standard population of Glover's 7 life table, 
denoted in the formula by P; S denotes summation of all six values; 
3/= mean age of living population in any community; M v =mean age 
of persons in a stationary population unaffected by migration and 
which, assuming tho mortality rates of Glover's life table, would 
result if 100,000 persons were born alive uniformly throughout eaeh 
year ( M p calculated from L x lino of Glover's table (p. -16), =33.796 
years) . 

This procedure simply multiplies our former index x 2 by the 
difference (given its proper sign) between the mean age of the 
observed population and the mean age of the standard population 
on the basis of which x 2 was calculated. Since, infact, the' mean age 
of any actual urban population is. never likely to be ae great as 
the mean age of the stationary population chosen as a standard of 
reference, the actual values of # will practically always be negative 
for cities. The smaller these negative values are numerically, the 
greater will be the proportion of older persons in the population 
concerned. In short, this function <$> tells us not only the degree 
to which a given population deviates in its age distribution from a 
fixed standard age distribution, but also the nature of this deviation, 
whether on the one hand in the direction of a relative excess of 

«Ct. Pearl, R., On a Single Numerical Index of the Ago Distribution of a Population: Pro?. Nat. Acad. 
Sci., vol. 6, pp. 427-431, 1920. 
7 Glover, J. W., United States Life Tables, 1910, Bureau of the Census, 1916. 
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aged, or on the other hand in the direction of a relative excess of 
the young. Theoretically it is possible for two populations differing 
from one another in a compensatory way to give the same values 
for the index <j>. But two populations which differ in age distribution 
in any fundamental respect which could affect appreciably crude 
death rates will, in all populations I have been able to test, give 
different values of <j>, provided the age classification from which 
the function is calculated is finely enough divided. 

5. As an index of the sex distribution of the population, the male 
sex ratio was expressed as the ratio of males to 100 females in 1910. 

G. The density of population in each city was calculated from 
data furnished in the "Financial Statistics of Cities," issued annually 
by the Bureau of the Census, and was expressed as the number of 
persons per acre of land area within the legally defined limits of 
the city. 

7 and 8." In the earlier study the geographical position of the 
cities was indicated by .the linear distance of each from Boston, as 
measured : on the map. This was recognized as a very rough 
approximation. The interest in having some measure, in a study 
of this kind, of geographical position is twofold: First, that which 
arises from purely epidemiological considerations, namely, as afford- 
ing, in relation to time, an index of the rate of spread of an infectious 
epidemic disease from a primary focus; and secondly, the fact that 
geographical position, especially latitude, is a rough but on the 
whole fairly accurate index of general climatological conditions. 
It was decided to make the expression of geographical position 
more accurate in the present study, and consequently there were 
included as definite variables the latitude and longitude of each of 
the cities considered. 

9. Rate of growth of population 1900-1910. The reason for the 
inclusion of this variable in the study w T as twofold. Primarily 
this may be taken as a rough but probably fairly accurate index of 
the degree of industrialization of a city. In general, those cities 
which are growing most rapidly in population are those in which 
the most rapid industrial development is taking place. It would 
be much better if the rate of growth of the population between 
1910 and the outbreak of the epidemic could have been used; but 
accurate data are lacking, nor will they be available until the results 
of the 1920 census are published. Consequently this variable must 
be regarded as a rough approximation to one that we should like 
to measure more accurately, namely, the present state and recent 
rate of industrial development. In the second place, rate of growth 
is a definite biological characteristic of a population, 8 and as such 

a CJ. Pearl, B., and Reed, L. J., On the Bate of Growth of the Population of the United States since 
1790 and its Mathematical Representation: Proc. Nat. Acad. Sei., vol, 0, pp. 275-288,. 1920. 



279 



February 18, 1021. 



worthy of inclusion in any study relating an epidemic disease to 
demographic conditions. 

III. Data. 

The actual data used in the correlations are given in Table 1. 
Since in other work it was desired to correlate destructiveness of 
the epidemic, as measured by the 25-week excess mortality, with 
the other variables, only 34 cities could be used, because only for 
that number are the excess mortality figures available. 

Table 1. — Data/or correlation of 'charaeterktics of cities with explosivcness of epidemic 

influenza mortality. 

(The subscript numbers at the heads of columns correspond to the list of variables given on p. 275.) 



City. 



Albany, N.Y 

Atlanta, Ga 

Baltimore Md 

Boston, Mass 

Buffalo, N.Y...:.... 

Cambridge, Mass 

Chicago, 111 

Cincinnati, Ohio..... 
Cleveland, Ohio. 

Columbus, Ohio 

Dayton, Ohio 

Fall River, Mass 

Grand Rapids, Mich 

Indianapolis, lnd 

Louisville, Ky 

Los Angeles, Calif. . . 

Lowell, Mass 

Milwaukee, Wis 

Minneapolis, Minn... 

Nashville, Tenn 

Newark, N.J..... 

New Haven, Conn... 
New Orleans, La*. . .. 

New York. N.Y 

Oakland, Calif 

Philadelphia, Pa 

Pittsburgh, Pa 

Providence. R.I 

Rochester, N. Y 

St. Louis, Mo 

St. Paul, Minn 

San Francisco, Calif. 

Toledo, Ohio 

Washington, D. C . . . 



32.2 
5.3 
43.4 
28.8 
21.1 
17.9 
13.2 

9.2 

17.7 
11.0 
28.8 
27.8 
2.5 
8.6 
21 

10.0 

24.7 

4.9 

4.1 

41. 5 

14.1 

11.1 

34.1 

12.8 

15.1 

47.9 

21.3 

14.0 

13.9 

5.4 

4.5 

25.4 

17.9 

30.7 



3a. 


3b. 


3c. 


224.5 


236.4 


180.1 


113.(1 


121.0 


159.2 


202.7 


195.9 


179.0 


14). 8 


211. 5 


101.0 


143.0 135.9 


125.0 


178.8i 183.9 


73.6 


139.5' 159.1 


110.4 


206. Si 202. 5 


162.0 


132.8, 118.8 


90.0 


131.8 1.55.2 


88.0 


139.2 1st; 3 


103.2 


139.6' 164.0 


108.5 


76; 3 145.4 


94.7 


164.2 185. 7 


1U.1 


160.1 


161.7 


161.1 


182.1 


153. 


103.8 


112.5 


157.7 


90.7 


82.4 


100.7 


73.7 


120.4 


114.1 


101.3 


183.3 


193. 7 


125.1 


151.0 


143.3 


137.1 


10'. « 


185.4 


125. 1 


270. 9 


224.4 


245.6 


160.5 


165.3 


128.3 


98.7 


187.6 


87.7 


109. i) 


203.5 


173. 3 


115.9 


134.5 


94.3 


133.3 


152.8 


130.2 


95. 7 


203.1 


138.0 


132. 2 


141.3 


171.9 


KB. (i 


US. 9 


79.7 


170.0 


237. 2 


132. 5 


m 7 


185. 2 


8S.7 


185. 5 


235.1 


176.2 



3d. 



10.8 

19. 5 

18.1 

3.9 

10.3 

2.7 

4.1 

4.8 

7.0 

11.4 

17.3 

14.3 

19.0 

17.1 

14.2 

4. - 

11.2 

8.8 

6. 

30.0 

4.3 

13.0 

22.3 

4.0 

4.4 

6.9 

10.4 

6.4 

4.7 

8.1 

5.0 

5.7, 

15.0 

12. 7i 



3e. 



3f. 



137.61. 
66. 5 I! 

107. si; 

117.01 

97. 8 1, 
118.81, 

88.81. 
111.51, 
; 82.41, 
105.1,1, 
107. 7|1, 

83.91 

98. 4 1 



95. 3 
83.5 

103.5 
79.8 
85.3 
93.8 
83.3 
88.5 

112.1 
95. 



85. 0|1 
94. 21 
97. 6|l 
90. 7|1 
99.111 

109. 8|i: 

97.31 

86: 21 

138.0,1 

103. 9 1 

110. 51 



917. 
Ml. % 
810. 3 
651. 6 
533. 4 
354.4 
457. 1 
018.0 
432. 1 

498. 3 
492. 7 
639. 4 
258.8 
550.2 
5(2. 1 
258.2 
681.9 

214. 4 
189. 4 
685. 7 
412.2! 
630. 0' 
985.0 
384.7 
082. 8 • 
630. 1| 
094.0! 
527.0- 
453. 8,- 
458. 
093.0' 
546. 5 ! - 
705. l! 
795.6- 



-10.73 
-82.71 
-31.81 
-31.05 
-48. 61 
-30. 20 
-68. 73 
-22. 91 
-74. 51 
-29.08 
-24. 10 
-73. 39 
-27. 95 
-24.25 
-3M9 
-13.61 
-34. 91 
-62. 17 
-55. 39 
-48. 32 
-62.54 
-33.70 
-52.44 
-74. 42 
-17.18 
-34. 27 
-71. 77 
-30.05 
-20.62 
-44.57 
-08. 95 
-34. 34 
-33.21 
-20.67 



92.9 

92.711. 

92.430. 

96. 7 27. 
100. 13. 
91. 7 28. 



103.3 
95.4 9 
103. 6 20 
101. 5 15 
101. 9 12 
93.4 5 
97. 4 11 



9K 7 10. 

91. 1 16. 
105.9 2. 

94. 1 13. 
102. 8 26. 
139.211. 

89.6|10. 

99. ! 27. 

99. 7 15. 



■!)?. 8 
99.9 

108. 7 
95. 4 

105. 



93.7-22. 
98. 718. 
1.01. 5 ! 19. 






103. 
131.6 
101. 1 
91.3 



89 42. 65' 
4233.73 
57,39.28 
36 42. 30 
97 42. 88 
23|42.38 
28,41. 88 
1038. 14 
0841.'50 
1840.00 
65 39. 73 
9141.70 
85 42. 97 
96!39. 07 
01 138.20 
40]34. 08 
63 42. 65 
92 45. 05 
27,44. 97 
11 136.15 
52'40. 75 
03 41. 32 
9)29.97 
54 ! 40. 71 
4137.75 
02139. 95 
81 40. 43 

35 41. 81 
62 43. 13 

36 38. 63 
4944. 87 
55 37. 79 
9141.77 
55 38. 89 



73. 75' 
84.33 
73.02 
71.06 
78.92 
71.13 
87.00 
84.42 
81.70 
83.00 
84.18 
71.15 
85. 70 
86.13 
85.70 

118.20 
71.32 
87.95 
93.30 
86., 80 
74.17 
72.92 
90. 08 
74.00 

122.33 
75.17 
80.03' 
71.40 
77. S5 
90.20 
93. 08 

122. 43 
83.* aii 
77.03 



0.5 
72.3 

9.7 
19.6 
20.2 
14.1 
28.7 
11.6 
46.9 
44.6 
36.6 
13.8 
28.6 
38.1 
9.4 

211.5 
11.9 
31.0 
48.7 
36.5 
41.2 
23.7 
18.1 
38.7 

124.3 
19.7 
18.2 
27.8 
34.2 
19.4 
31.7 
21.0 
27. S 
18.8 



IV. Demographic and Environmental Correlation. 

Wc come now to the consideration of the results. The net influence 
of the several demographic and environmental factors upon explosive- 
ness of epidemic outbreak may be first considered. As we are ob- 
viously interested in getting at the net influence of each factor, such as 
age distribution of the population upon variation in the epidemicity 
index, while all the other factors for which we have data are held to 
constant values, we may pass at once to the fifth order correlation 
coefficients, without stopping for 'detailed consideration of the lower 
order coefficients leading to the final values. Such points as do need 
discussion in connection with these lower order coefficients will be 
brought out in connection with the fifth order results. The net 
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correlations between epidemicity index (subscript 1) and each of the 
six demographic and environmental factors taken one at a time, with 
the other five held constant, arc exhibited in Table 2. It is to.be 
understood in this, and all subsequent papers in this series, that 
subscript numbers to the right of the decimal point denote variables 
held constant, and subscript numbers to the left of the point denote 
the variables correlated, r is, of course, simply the conventional 
symbol of a correlation coefficient. 

Table II. — Net correlation of explosiveness of outbreak (7 6 ) with various demographic 

and environmental factors. 



Variable correlated with cxplosheiicss (,l c ). 



Age distribution of population H. 56789 

Sex ratio ol population .. 15, 46789 

Density of population , 16. 45789 

Latitude of city 17.45689 

Longitude of citv i'. 18.45679 

Rate of growth of population, 1900-1910 : 19.45678 



r subscripts. 



Coefficient. 



+0.281 ±0.107 
-0.001 ±0.116 
+0.099 ±0.115 
-0.389. ±0.100 
-0.085 ±0.115 
-0.288 ±0.106 



Taking the several variables in order Ave note: 

1. Explosiveness of outbreak of epidemic mortality can not be 
positively asserted to be significantly correlated with the age distri- 
bution of the population. The coefficient +0.281 is less than 3 times 
its probable error. The plus sign means, having regard to the method 
of calculating age indices explained above, that so far as there is any 
correlation, high values of the explosiveness index 7 6 tended to be 
associated with populations having a higher proportion of older per- 
sons than the average. No one of the lower order correlations of 
explosiveness and age index had a value as much as 3 times its 
probable error. The highest coefficient in the series was the fourth 
order r 11 . BJ „= +0.300 ±0.105. The zero order r u = +0.194 + 0.111. 
All r u correlations, whatever the secondary subscripts, within the 
group now under consideration, are positive. In the first of these 
studies (lac. pit., p. 38) the explosiveness-age correlation of zero order, 
with ihe less exact age distribution index, was r u = —0.262 + 0.101, 
again a value nearly but not quite certainly significant. With border 
line values such as these, one can only say that in cities constant in 
respect to sex ratio of population, density of population, position, 
and rate of recent growth of population, the age composition of the 
population may have a slight influence in determining explosiveness 
of outbreak of epidemic mortality, but at best the influence must be 
very small. 

2. The sex ratio of the population is plainly not significantly cor- 
related with epidemicity index. The fifth order coefficient has a value 
— 0.001 ±0.116, which is sensibly zero. This is an interesting exam- 
ple of how partial or net correlations may differ from total correla- 
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tions. The zero order coefficient between cxplosiveness (7 a ) and sex 
ratio is /•.,.= ■■ — 0.307 ±0.1.05. This is a probably significant value, 
but arises not from any direct relation of variables 1 and 5, but indi- 
rectly through the relation of both of these to the positional variables 
7 and 8 (latitude and longitude). In the group of cities here dealt 
with there is a relatively high correlation between male sex ratio and 
longitude, and between sex ratio and- latitude, when longitude is con- 
stant. These relations are shown in the following coefficients: 

f 67 = ±0.134±0.114 (sex ratio and latitude). 

r- lS = +0.678 ±0.062 (sex ratio and longitude). 

T f>v*~ ±0.607 ±0.073 (sex ratio and latitude — longitude constant). 

r 58 . 7 = ±0.808 ±0.040 (sex ratio and longitude — latitude constant). 

Because of the inverse and nearly equal correlations of epideraicity 
index with latitude and longitude, the sex-ratio correlation with epi- 
demicity index is neutralized as soon as these other variables are 
brought into the system. Thus while we have r Vt = — 0,307±0.105, 
we get r liV78 = ±0.023 ±0.116, or practically zero. We may then 
safety conclude that the proportion of males (or of females) in the 
population of a city had no sensible direct influence in determining 
the cxplosiveness of outbreak of epidemic mortality. 

3. The net fifth order correlation of cxplosiveness of outbreak of 
the epidemic mortality with density of population is again sensibly 
zero. This is true whatever the variables held constant within tire 
group here discussed. The zero order coefficient is ?' I0 = ±0.073 ± 
0.115. Nowhere in the series does a coefficient having primary sub- 
scripts 16 rise to a value even approaching 3 times the probable error. 
This result, that density of population, which is the measure of urban 
crowding in this case, had nothing to do with determining the cxplo- 
siveness of outbreak of the epidemic mortality, while surprising on 
grounds of purely a 'priori logic — always, by the way, most unsafe 
grounds — fully confirms with more critical data the res.ult attained 
in the earlier study (he. cit., p. 36). 

4. In the case of the correlation of cxplosiveness of outbreak of 
epidemic mortality with the latitude of the city, a very different 
result appears. The fifth order coefficient is r J7 . <5M3 = —0.369 ±0.100. 
This is more than three times its probable error and is probably 
statistically significant; but before drawing any conclusions about 
its epidemiological significance we must critically look into its genesis. 
In the first place it must be noted that the 34 cities with which we are 
dealing are not scattered at random over the United States. All but 
three of them arc either on or east of the Mississippi River. This 
distribution is well shown in the map exhibited as Figure 1. 

Not only are the cities mostly in the eastern half of the country, 
but, what is more important, they nearly all fall in a fairly narrow 
northeast-southwest belt. How clearly this is so is shown by the 
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regression of latitude on longitude, which is plotted on the map. The 
distribution of these cities is so far from random orer the whole area 
of the country that there is a statistically significant correlation for 
these 34 cities, between latitude and longitude, the coefficient being 
r 78 = —0.404 ±0.097. In words, what this means is that, within this 
group of 34 cities, in general, the farther north a city is the farther east 
it is. 

Now, neither latitude nor longitude is alone significantly correlated 
with the epidemicity index J 6 , as witness the zero order coefficients 

r 17 =- 0.243 ±0.109 
r 18 =- 0.229 ±0.1 10 

But because of the relatively large value of r. s which has just been 
pointed out we get at once 

%. s =- 0.376 ±0.099 (significant) 
fi 8 .7= - 0.369 ±0 100 (significant). 

The same influence makes itself felt throughout the series of ascending 
order partial coefficients. Thus, the relevant second order coefficients 
are: 



r n.4o = 


-0.208 ±0.1 11 


'*18.fo = 


-0.092 ±0.115 


r n.i« = 


-0.303 ±0.105 


''ls.ie^ 


-0.242 ±0.109 


^17.48 = 


-0.387 ±0.098 


'°18.47 = 


-0.407 ±0.097 


r n.i»~ 


-0.407 ±0.097 


r is.v>~ 


-0.074±0.115 


'"n.so = 


-0.264 ±0.108 


r i8.50 = 


+ 0.022 ±0.116 


*"l7.58 = 


-0.319±0.104 


^18.37 = 


-0.245 ±0.109 


, 'l7.50 == 


-0.358±0.101 


*'l8.!>» = 


+ 0.194 + 0.111 


^17.68 = 


-0.381 ±0.099 


^18.07 = 


-0.333 ±0.103 


'*i7.e» = 


-0.403 ±0.097 


?> 18.0'J = 


-0.061 ±0.115 


r n.s9~ 


-0.421 ±0.095 


r i».-»~ 


-0.160±0.113 



It is at once evident that most of the r 17 or latitude coefficients are 
three or more time their probable errors. . Most of the r lg , or longitude 
coefficients are, on the contrary, less than three times their probable 
errors, the only ones arising to a higher value being those carrying 7 as 
a secondary subscript and not all of those. In other words, we come 
here to a separating point between the correlations carrying 1 and 7 as 
primary subscripts and those carrying 1 and 8 in the same position. 
This divergence comes about from the different correlations of certain 
of the other variables with latitude and longitude. Thus we have, 
for sex-ratio correlations, r,-= +0.134 ±0.1 14 against r 85 = + 0.678 ± 
0.062. Age distribution index is not significantly correlated with 
either latitude or longitude in this group of cities. Density is sig- 
nificantly and about equally correlated with both, the coefficients 
being r m = -0.424 ±0.095, and r 76 = -0.371 ±0.100. Kate of growth 
in this group of cities is nearly twice as highly correlated with 
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longitude as with latitude, the signs of course being opposite. 
Tlie coefficients are r 7B = -0.365 ±0.100 and r 89 = +0.642 ±0.068. 

Without pursuing this complex trail further, certain things are 
clear. In the first place it is evident that it will be quite unsafe to 
draw biostatistical or demographic conclusions about, or on the basis 
of data from, the large American cities, withqut having critical re- 
gard for the fact here demonstrated that some of the most important 
of these characteristics are significantly correlated with the mere 
geographical position of the cities. In the second place, because -of 
this fact, we can not be quite sure of the epidemiological significance 
of the fact that, in this group of cities, there is a statistically signifi- 
cant negative correlation between epidemiological index and lati- 
tude. Taken at its face value this coefficient means that, on the 
average, the'' outbreak of epidemic mortality was more suddenly 
explosive, the farther south the city, when the other factors of ago 
distribution, sex ratio, density, and rate of growth of population 
were constant and equal. "Whether this bespeaks a real and general 
biological phenomenon resting presumably upon a climatological 
base can not be critically determined until we can study the matter in 
a group of localities distributed' in a more random manner than in 
the present sample, so that the correlations between latitude and 
longitude shall be more nearly zero in value. It is interesting to 
note, however, that the present result in respect to latitude correla- 
tion is in accord with general clinical and pathological experience. 
I am told by Dr. William II. Welch that it has long been recognized 
by clinicians and pathologists that acute respiratory infections, and 
particularly the pneumonias, tend to become more fatal as one 
passes from north to south. There is a splendid chance here for a 
critical statistical investigation of the matter, and with the rapid 
extension of the registration area into the South in recent years, 
adequate data will shortly be available. 

5. Kate of growth of population of these cities in the decade 1900 
to 1910 is a factor connected with age of city, with its industrial and 
commercial prosperity and activity, and indirectly with sanitation, 
because usually in cities growing very rapidly, sanitary arrange- 
ments tend to lag behind the need for them. The net fifth order 
correlation of this variable with explosiveness of outbreak of epi- 
demic mortality is r 10 , 4507S = —0.288 ±0.106. This is a border-line 
value, which can not be safely asserted to differ significantly from 
zero. Koughly speaking, a value as large as this would turn up 
purely by chance about 7 times in every 100 trials with samples of 
the size here dealt with. The gross, zero order coefficient between 
these variables is r 10 = -0.302 ±0.105, again not a significant value. 
No one of the partial coefficients having 1 and 9 as primary sub- 
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scripts is certainly significant in comparison with its probable error, 
except such as carry also 7 as a secondary subscript. On the whole, 
it can be safely asserted that if the rate at which a city had recently 
been growing in population had anything at all to do with the degree 
of explosiveness of outbreak of the 1918 epidemic mortality, this 
influence must have been at the most extremely slight. 

G. In general, the results of this more extended and critical study 
of the influence of these demographic and environmental factors 
confirm the earlier preliminary examination of the data. The only 
variable in the lot that has a statistically significant net correlation 
with the explosiveness index is latitude. In the former study! use 
was made of linear distance from Boston as a locality index instead of 
the more general aiid exact latitude and longitude plan here adopted. 
But the correlation result obtained with the cruder variables was 
the same in sense, and of about the same order of magnitude as the 
more refined net fifth order latitude correlations. I have already 
indicated fully the wisdom of caution for the present, in drawing 
biological conclusions from this latitude correlation. It is possible 
that the age distribution of the population and the rate of recent 
growth of the city had some slight influence upon the explosiveness 
of outbreak of the epidemic mortality; but in either case the effect 
must have been so slight as to be negligible so far as any practical 
epidemiological significance is concerned. 

V. Death Rate Correlations. 

We may turn now to another series of correlations. In the first 
study it was shown that the most significant correlations of explosive- 
ness of outbreak of epidemic mortality were with the normal death 
rates from certain primarily organic diseases. These results, because 
of their novelty and possibly far-reaching theoretical significance, have 
been thought particularly to need critical reexamination. As has 
already been pointed out, we have in the present study taken the 
mean death rate for three years as indicative of normal conditions 
instead of a single year as in the earlier work. 

The sixth order coefficients are exhibited in Table III. These 
coefficients measure the net correlation existing between the epidem- 
icity index 7,. and the specified normal death rate, when the cities 
involved have been made constant and equal in respect to age and 
sex constitution of population, to density and rate of recent growth 
of population, and to latitude and longitude. 
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Table III. — Net correlation of cxplosivemss of outbreak (7 6 ) with the normal death rates 
from certain specified causes. 



Variable correlated with explosiveness (J 6 ). 
Death rate from— 



r subscripts. 



Coefficient. 



All causes. 

Pulmonary tuberculosis 

Organic diseases of 1 he heart 

Acute nephritis and Bright's disease 

Typhoid fever... I.... 

Cancer and other malignant tumors. 



13f. 436789 
13a. 456789 
13b. 456789 
13C. 456789 
13d. 456789 
13e. 456789 



+0.572 ±0. 078 
+0.389 ±0.098 
+0.562 ±0.079 
+0.307 ±0.105 
+0.105 ±0.114 
+0.141 ±0.113 



It is at once evident that these correlations are of a generally 
different order of magnitude from those of Table II. Specifically 
wo note : 

1. The highest correlation is that for the death rate from all causes; 
hut that for organic diseases of the heart is practically identical. 
These coefficients are more than seven times the probable error and 
are certainly significant. The corresponding zero order coefficients 
arc: 

All causes, r al = + 0.678 ±0.063 
Organic heart, r, 3b = + 0.642 ± 0.0S8 
From these values, in comparison with those of Table III, it 
appears that by making all six demographic and environmental 
factors constant, the con-elation between explosiveness and the 
normal death rate from all causes or that from organic diseases of the 
heart, .is not altered to a degree statistically significant. This com- 
parison, indicates the overwhelming importance of the biological 
factor which these death rates measure in determining the explosive- 
ness of the outbreak of epidemic mortality, as compared with the 
demographic and environmental factors previously considered. 

2. Next to organic diseases of the heart, pulmonary tuberculosis 
is the single cause having its normal death rate most highly correlated 
with explosiveness of outbreak of the epidemic. The net sixth order 
coefficient for this disease, however, is distinctly lower than that 
for. organic diseases of the heart.. Furthermore it is much more 
reduced by the process of making demographic and environmental 
factors constant, as is indicated by the fact that the. zero order 
coefficient for this disease is r 13a = +0.578 ±0.077. The difference 
between this and the sixth order coefficient r 13 a.«5 8789 = +0.389 ±0.098 
is 0.189 ±0.125. While not statistically significant in comparison 
with the probable error, it comes much nearer being so than the 
corresponding difference for the organic heart correlation, which is 
0.080 ±0.104. 

3. In the case of the normal death rate from breakdown of the 
kidneys (acute nephritis and Bright's disease) the sixth order net 
coefficient is on the border line of probable statistical significance, 
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having a value just under tlirec times its probable error. Again the 
process of making the six demographic and. environmental factors 
constant has materially reduced the correlation between the normal 
death rate from these diseases of the kidneys and the cxplosiveness of 
outbreak of epidemic mortality. The zero order correlation bore is 
f 13C = +0.447 ±0.093. The difference between the zero order and 
the sixth order coefficients is 0.140. 

4. Turning to , diseases of wholly different etiology from those 
dealt with up to this point, namely, typhoid fever and cancer, the 
correlations between the normal death rate from these diseases and 
the cxplosiveness of outbreak of epidemic mortality are found to be 
of an entirely different order of magnitude. In neither case, typhoid 
or cancer, is the coefficient sensibly different from zero, having regard 
to its probable error. Clearly, whatever factors, biological or environ- 
mental, or both, arc measured by these death rates can' not have had 
any sensible influence in determining the suddenness with which the 
mortality curve rose during the influenza epidemic. It is generally 
held on good grouuds that the typhoid death rate is an excellent 
index of the general sanitary status of a community. If it may bo 
so accepted in the present connection, the result just stated bears 
out in precise mathematical terms what was obvious to the thoughtful 
and candid observer at the time of the epidemic, namely, that the 
severity with which a city was bit by the epidemic bore no relation 
to its general sanitary status or to the efficiency of its health organiza- 
tion. In this connection there is a further interesting mathematical 
point regarding the typhoid correlation. The zero order gross correla- 
tion for this death rate with 7„ is r 13d = +0.342 ±0.102. This is a 
statistically significant correlation; but observe that as we make the 
cities constant in respect to demographic and environmental factors 
the reduction in the correlation is very great/ending with the sixth 
order coefficient at a value r 13 d. 4587S 9= +0.105+0.114, a drop of 
0.237 in the coefficient. But this is exactly what would be expected 
when it is recalled to what an extent the typhoid death rate of a 
community depends upon the environmental conditions in that 
community. 

5. The normal cancer death rate is not significantly correlated with, 
the cpidemicity at any stage, nor is the correlation altered to any 
extent by making the demographic and environmental factors con- 
stant. This is indicated by the fact that the zero order coefficient 
is r , j 3e = +0.235 + 0.109, while the sixth order coefficient is lower by 
only 0.094. 

6. Taking all the results together they confirm, in general, but 
refine, those of the earlier study. It is now seen that a considerable 
part of the high correlation then found for the tuberculosis death rate 
with epidemicity index disappears if the cities are made constant and 
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equal in respect to some six important demographic and environmen- 
tal factors. The correlation is still, however, in spite of the reduction, 
significant in comparison with its probable error. Death rate from all 
causes is highly correlated under all circumstances with explosivc- 
ness of outbreak. This death rate may be taken as an index of the 
general hcalthfulness of the community. 9 But the outstanding single 
factor, which apparently more than any other one thing j*et discovered, 
determined how abruptly or explosively the mortality was to rise at 
the outbreak of the epidemic, was the normal death rate in the com- 
munity from organic diseases of the heart. This extraordinary and 
striking fact will be more fully discussed in the next paper in this 
series. 

VI. Summary. 

In this second study a further more refined and detailed analysis 
is made of the weekly mortality statistics of the influenza epidemic 
of the autumn of 1918. Using the method of multiple or partial 
correlations the attempt is made to determine more critically the 
factors chiefly responsible for the great variation exhibited among 34 
large American cities in respect to the degree of suddenness or 
explosiveness of outbreak of epidemic mortality. A new and more 
critically exact epidemicity index is described, as well as a more 
accurate single numerical index of the age distribution of a population. 
Every effort was made to get critical quantitative measures of the 
variables dealt with. In general, this more extended and refined 
study confirms the results of the fii'st. It is believed that the critical 
refinements introduced in this and the succeeding studies fully meet 
the implied criticisms of Winslow and Rogers 10 on the first studj-, 
at least so far as concerns readers capable of understanding the 
implications of the theory of probability. It is, for example, simply 
idle to assert that a surely significant correlation between death rate 
from organic diseases of the heart and explosiveness of outbreak of 
influenza may bo "due to peculiarities in age distribution of the 
population" when it is conclusively shown, as it here is, that the same 
correlation prevails ichen the age distributions of all the populations 
concerned are held constant. If the results of these studies are to be 
successfully controverted it must be upon a different basis than logic 
of this sort. Of the six environmental and demographic variables 
tested, only one, namely, the city's latitude, was found to have a 
statistically significant net correlation with the explosiveness or 
epidemicity index. In that case the biological or epidemiological 
significance of the result is doubtful, because of the peculiar non- 

8 It is so used, for example, by Brownlee, J., An Investigation into the Epidemiology of Phthisis in Great 
Britain and Ireland: Med. Re?. Cumin. Spec. Rept. Scr. No. 46, pp. 1-9S, 1920. 

10 Winslow, C.E. A., and Rogers, J. F., Statistics of the 191S Epidemic of Influenza in Connecticut: 
Jour. Inf, Dis. vol. 2T>, pp. 184-216, 1920. 



289 February 18, 1021. 

random spatial distribution of the 34 cities furnishing data. On the 
other hand, the normal death rate from all causes and from organic 
diseases of the heart are correlated to a relatively high degree with 
explosiveness of outbreak of epidemic mortality. The normal death 
rate from pulmonary tuberculosis is also sensibly, but not highly, 
correlated with explosiveness of influenza outbreak. The normal 
death rates from typhoid fever and from cancer show no sensible 
net correlation with explosiveness. 

III. ON THE CORRELATION OF DESTRUCTIVENESS OP THE 1918 

EPIDEMIC " 

I. Introduction. 

It was pointed out by friendly critics after the appearance of the 
first of these Studies 13 that the epidemiological character there dealt 
with was explosiveness of outbreak, as measured by an epidemicity 
index 7 5 subsequently modified into I as described in the second of 
these Studies 13 , and that as explosiveness of outbreak of epidemic 
mortality was obviously a characteristic which might conceivably be 
nearly or quite distinct and . independent from destructiveness as 
measured by the total number of persons killed by the epidemic, it was 
a matter of doubt what might be the real meaning of the biological 
conclusions flowing from the peculiar facts brought out in that first 
study, since confirmed and extended in the second. This criticism 
was well taken. The obvious answer to it, however, is to study 
destructiveness in the same way that we have explosiveness of out- 
break, by the method of multiple correlation. It is the purpose of 
this paper to present the results of just such a study, in which the 
extended and critically defined series of variables of Influenza Studies 
II were used. 

The problem to which this paper is addressed may then be stated 
in the following terms : The 34 large American cities for which wo have 
available published weekly data, varied enormously among themselves 
in respect to the destructiveness of the epidemic, as measured by 
the number of their inhabitants who died in excess of the normal, 
during the autumn and winter of 1918-19. What factor significantly 
influenced or determined this variation ? 

The methods by which this problem is attacked in the present 
paper are precisely the same as those of the second of these Studies, 
and the reader is referred to the introductory portion of that paper 
for the general discussion of methods used. 

11 Papers from the Department of Biometry and Vital Statistics, School of Hygiene and Public Health, 
Johns Hopkins University. No. 22. 

13 Pearl, R., Influenza Studies. I. On Certain General Statistical Aspects of the 1918 Epidemic in Ameri- 
can Cities: Public Health Reports, vol. 34, No. 32, pp. 1743-1783. 1919. Reprint No. 548, pp. 1-43. All 
citations in this paper arc in terms of the pagination of the reprint. 

'• Sec p. 276. 

28609°— 21 2 
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II. Variables Discussed. 

The variables discussed in the present paper are precisely, the 
same as those listed in Influenza Studies II (p. 275) with two excep- 
tions. These are (a) that here the variable I tt (explosiveness of out- 
break) is omitted, and (b) its place is taken by a new variable, 
destructiveness, which is indicated by the subscript 2 in all that 
follows. Destructiveness is measured by the 25-week excess mor- 
tality rates calculated and published by the Bureau of the Cen- 
sus; 14 These 25-week excess fates indicate the number of people 
dying from all causes, during the 25 weeks following the initial 
outbreak of the epidemic in this country in the autumn of 1918, 
in excess of the number who probably would have died in the 
same period had no epidemic occurred. The rates for the 34 cities 
are given in Table I (p. 12) of Influenza Studies I, and hence need 
not be reprinted here. The vajues of the other variables used in 
this paper arc given in Table I of Influenza Studies II (p. 279). 

III. Demographic and Environmental Correlations. 

Just as in the discussion of explosiveness of outbreak, we may 
consider first the net fifth order correlations of destructiveness 
with the several demographic and environmental factors for which 
we have data. The coefficients are exhibited in Table I. 

Table 1. — Net correlation of destructiveness (25-week excess mortality) with furious 
demographic and environmental factors. 



Variable correlated with destructiveness (2o-wcck excess mortality). 



Age distribution of population 



r subscripts. 



24.36789 

Sex ratio of population . I 25. 46789 

Density of population .: 1 26. '5789 

Latitude of city j 27.45689 

Longitude of city :! 28. 45679 

Rate of growth of population, 1900-1910 1 29. 45678 



Coefficient. 



+0.132±0.1U 
-t-0. 161±0. 113 
+0. 163±0. 113 
-0. 424±0. 095 
-0. 133±0. It4 
-0.083±0.115 



The results here arc clear cut. The only variable having a net 
coefficient which can be regarded as sensibly different from zero is 
latitude. There the coefficient is certainly statistically significant. 
Taken at its face value, and remembering the significance of the 
negative sign, this means that the farther south the city the greater 
the rate of mortality in excess of the normal. Or put in another 
way, the coefficient indicates that the influenza epidemic had a 
definite, though at best not marked, tendency to be more destructive 
of life in southern than in northern latitudes. Too much stress 
must not be laid upon this result, however, because of the peculiar 
spatial distribution of this group of 34 cities. This point has been 
fully discussed in the preceding study of this series and need not be 

" CI. Public Health Reports, vol. 34, No. 11, p. 505, 1919. 
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dwelt on again hero. The same reasons exist for suspending judg- 
ment as to the full biological significance of the apparently definite 
correlation between destructiveness of the epidemic and latitude, as 
were emphasized in the case of the similar correlation between ex- 
plosiveiiess of outbreak and latitude. 

The remaining coefficients arc very definitely so small as to leave 
no doubt about their meaning. Throughout the long process of 
making more and more variables constant and getting the .successive 
higher order net coefficients, no one of these demographic characters, 
except latitude, ever showed a value sensibly differing from zero. 
This fact is well indicated by the zero order coefficients, which are as 
follows: 

■»V=+Q..024 ±0.116 (age), 

»»'=.- 0.029 ±0.116 (sex).' 

r M =-+0. Ill ±0.'ll4'(density). 

%=-0.325'±0. 103 (latitude). 

r» = +0. 0007 ±0. 116 (longitude). 

,^9= -0. ; 071 ±0. 11 5. (growth). 

It can, then, bo safely assettcd that in the determination of the 
variation among these 34 large American cities in respect to the excess 
mortality due to the, epidemic, the age and sex distribution of the 
population, its degree of crowding (not its rate of recent growth), or 
the distance of the city west from the Atlantic seaboard, played iio 
appreciable part whatever. This conclusion is true whether all 
these factors were allowed to vary together, or whether, as in a 
laboratory experiment, one to five of them were hold constant, while 
the net influence of one or more varying alone was tested. 

Whether or not the same conclusion will hold generally for other 
cities of a different order of size, or for rural districts, remains to be 
shown by further work. But as to the facts for the 34 cities listed 
in Table I of the second of these Studies, there can be no doubt or 
argument. 

IV. Death Eate Correlations. 

Turning to the same set of normal death rate variables as were 
studied in connection with explosiveness we have the results set 
forth in Table II. 

Table II.-— Net correlation of destructiveness (,iii-icert: evcesn niorkilili/) with, the normal 
death rate from certain specified causes. 



Variable correlated with destructiveness (25-week excess mortality). 
Death rate from— 


r subscripts. 


Coefficient. 




231". 453739 
23a. 15378'.) 
23b. 458789 
23c. 4337X9 
23d. 4557S9 
230. 433789 






+0. 070 j. (i K17 




+0.537 ±0.082 
-0. OIK -HU If. 
—0.133± 0.1i:i 




+0.208 J- 0.107 
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From this table. wo note the following points: . 

1. Epidemic excess mortality is significantly correlated with the 
normal death rate from all causes in these cities. This is true in 
gross (r,,. if = +0.435 + 0.094) and also when the environmental and 
demographic factors listed in Table 1 above arc held constant, as m 
an experiment. In other words, the number of people dying during 
tire epidemic in each of these 34 cities- was determined to a significant, 
though not a high, degree by the usual; mortality relations of the 
community, as indicated by the normal death rate from all causes. 
Cities which have normally a high .death rate had also a relatively 
high mortality from the influenza epidemic, and vice versa those 
normally enjoying a relatively low mortality rate lost, but relatively 
few persons in. the epidemic. It is to be noted, however, that while 
the net coefficient r M!AM7s!l has a value more than 4 times its probable 
error, and therefore is to bo regarded as certainly significant statisti- 
cally, yet this correlation is lower than the corresponding one for 
explosiveness of outbreak and death rate from all causes (cf. Table 
III, Influenza Studies II). We have: 

Explosiveness, r v)Ltr>fnm =* + 0.572 ± 0.078 
Destructivencss, r 23f , 4 ,- )67g9 = +t).405 ± 0.097 



Difference 0.167 ±0.124 

While this difference is not significant in comparison with its probable 
error, nevertheless there is considerable probability that with a larger 
sample it would become so. It thus appears that the explosiveness 
of outbreak of the epidemic mortality was, perhaps, somewhat more 
influenced by the normal mortality rate of the community than was 
its total magnitude or destructivencss. 

2. The total destructiveness of the epidemic is not significantly 
correlated with the normal death rate of the community from pul- 
monary tuberculosis when the important demographic and environ- 
ment ai factors listed in Table 1 are' held constant. Here we come 
upon a distinct break between the two epidemiological characteris- 
tics, explosiveness and total destructiveness. The former is, and the 
latter is not, significantly correlated with the normal tuberculosis 
death rate. A difference of the same sense is evident in the zero order 
gross correlations, which are r 13a = +0.578 ±0.077 and r, 3a = + 0.428 ± 
0.094. Both of these gross values are more than 3 times the probable 
error, but oWe a considerable portion of their high values, as is dem- 
onstrated by the sixth order coefficients, to intercorrelations with 
other variables. 

3. The highest net correlation of destructiveness of the epidemic 
is with the normal death rate from organic diseases of the heart. 
When all of the demographic and environmental factors with which 
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we arc dealing are held constant for these cities, we find a correlation 
of +0.537 ±0.082, a value nearly 7 times its probable error, between 
these two variables. The coefficient is higher than that for the nor- 
mal death rate from all causes with destructiveness. It appears very 
clearly that of the 12 different factors here studied the normal death 
rate of the community from organic diseases of the heart had more 
to do with determining the proportionate part of the population 
which perished in the epidemic than any other factor. In those cities 
having normally a high heart-disease rate a relatively large number 
died in the influenza epidemic, and vice versa. The same thing was 
shown in the preceding study to be true for explosiveness of outbreak 
of epidemic mortality. The condition of the population in respect 
to cardiac soundness played a significant role in determining the sud- 
denness and frequency with which people died during the autumn 
and winter of 1918, when the 34 large American cities were struck 
hj the influenza epidemic. It is interesting to note that the net 
sixth order correlation coefficient is higher in the case of organic 
diseases of the heart than the gross zero order coefficient, which is 
} Mb = +0.487 ±0.08:8. This means that in the gross, or zero order 
coefficient, the true organic relationship existing between destruc- 
tiveness of epidemic and normal cardiac death rate is obscured by the 
fact that there is a high correlation between the latter variable and 
the age distribution of the population 0. jbl = +0.609 + 0.073), the 
meaning of this coefficient being, of course,, that the higher the average 
age of a population, the higher the death rate from organic diseases 
of the heart, and vice versa.— & relationship which would be expected 
a priori from what we know about cardiac affections. As soon as we 
make the cities all constant in respect to age distribution of the 
population, we get a marked increase in the correlation coefficient 
between epidemic destructiveness and normal cardiac rate (r 2 ,.t V( = 
+ 0.596 ± 0.075) . This is an increase of 0. 109 in the coefficient. But , 
as there is also a sensible negative correlation in this group of 34 
cities between normal death rate from organic heart diseases and lat- 
itude 3b . 7 = -0.300 ±0.105), the high value of f 23b . 4 = +0.596 + 0.075 
is reduced somewhat when the cities are made constant in respect 
to both age distribution and latitude, the coefficient being r nhA! --^ 
+ 0.549 ± 0.081. The other demographic and environmental variables 
have only negligible effect upon the r 23b correlation. 

4. The net correlation between destructiveness of the epidemic and 
the normal death rate from diseases of the kidneys is sensibly zero. 
Here again there is a marked contrast between explosiveness of 
epidemic mortality and destructiveness (cf. Table III, Influenza 
Studies II). The gross zero order correlation between destructive- 
ness and normal death rate from kidney diseases (r i30 ~ +0.282 + 
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O.IOC) is less than throe times its probable error, as well as the net 
sixth order coefficient. 

5. The destructiveness of the epidemic is not significantly corre- 
lated with the normal death rate from either typhoid fever or cancer, 
either net, when all the demographic and environmental variables are 
hold constant, or in gross (r 23 d=-+0.014±0.11G, and r 2 ., e = + 0.215 ± 
0.110). We observe the same contrast in the correlations of these 
diseases with destructiveness as we did in the previous study, when 
they were correlated with explosivcness, in comparison with the 
correlation of normal death rate from, organic diseases of the heart 
with these same epidemiological characteristics. 

V. Summary. 

In order to meet a jus tillable criticism of the earlier work we have 
determined in this paper the correlations between destructiveness 
of the influenza epidemic of 1918-19, as measured by the 25-week 
excess mortality in. the 34 cities used in the earlier work, and the 
scries of demographic, environmental, and biological (normal death 
rate) variables discussed in Studies II. It is found that there is, 
in the group of large cities, no significant net correlation between 
destructiveness of the epidemic as above defined, and any demo- 
graphic or environmental variable, with the exception of latitude. 
In that case, for reasons fully discussed in Studies II, the biological 
meaning to be attached to the result is not entirely clear. The 
highest net correlation between destructiveness and any of the 12 
variables discussed was with the normal death rate from organic 
diseases of the heart. Those cities, within tins group of 34, which 
have normally an unusually high death rate from cardiac disorder, 
had an unusually high epidemic mortality, and vice versa. There 
was no significant net correlation between destructiveness of the 
epidemic and the normal death rate from acute nephritis and Bright's- 
disease, nor with that from pulmonary tuberculosis. The correla- 
tion results indicate clearly that explosivcness of outbreak of epi- 
demic mortality and total destructiveness of the epidemic are distinct 
but related epidemological characteristics. 

IV. ON THE CORRELATION BETWEEN EXPLOSIVENESS AND TOTAL 
DESTRUCTIVENESS OP THE EPIDEMIC MORTALITY. 15 

The results of the second and third of these Studies indicate 
clearly that explosivcness of outbreak of epidemic mortality (as 
measured by the epidemicity index I e ) and total destructiveness of 
the epidemic (as measured by the 25-weck excess mortality) are 
distinct epidemiological characters. The question then presents 
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itself as to how closely these tAvo characters are correlated. To the 
answer of this question the present paper is addressed. 

For the 34 large American cities furnishing the material set forth 
in Table I of the second of these Studies, 16 we have data on the 
following variables : 

Subscript No. Variable. 

1. Explosiveness of outbreak of epidemic mortality. T . 

2. Destruetiveness (25--week. excess mortality). 

3a. Normal death Tate from pulmonary tuberculosis. 

3b. Normal death rate from organic heart diseases. 

3c. Normal death rate from acute nephritis and Blight's disease. 

3d. Normal death rate from typhoid fever. 

3e. Normal death rate from cancer and other malignant tumors. 

3f. Normal death rate from all causes. 

4. Age distribution of population. 

5. Sex raiti'o of population. 

6. Density of population. 

7. Latitude. 

8. Longitude. 

9. Rate of growth of population, 1900-19 10. 

The gross or; zero order correlation between explosiveness of 
epidemic mortality (I e ) i and total destruetiveness (25-vvcek excess, 
mortality) is 

n 3 =+ 0.709 ±0.058. 

This obviously represents a relatively high, but by no means 
perfect, correlation. In general, it means that cities having an inci- 
dence of epidemic mortality more sudden and explosive in its out- 
break than the average were highly likely to have a total mortality 
from the epidemic above the average, and vice versa. 

It is essential, however, just as in earlier parts of these Studies, 
to find the net value of this correlation when the various environmen- 
tal, demographic, and biological variables listed at the beginning of 
this paper, are held constant as in an experiment The initial step 
in such a process is to calculate the first-order coefficients, where 
successively, one at a time, each variable in the series is held con- 
stant while the correlation between variables 1 and 2, in which wo 
are interested, is determined. 1 In Table I are given these first-order 
correlation coefficients between. the variables 1 and 2 (explosiveness 
and destruetiveness). 



"Seep. 279. 

1 For a general discussion of (lie method or multiple correlation in Statistical work of this sort the reader 
isreterred to these Studies, IT, pp. 272-275. 
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Table 1. — First-order correlations between explosiveness (7 6 ) and destructiveness (25 
week excess mortality) of the 1918 influenza epidemic in 34 American cities. 



r subscripts. 


Coefficient. 


12.3a 


+0.626±0.070 


12.3b 


+0.S92±a<»75 


12.3c 


+0.679±0.062 


12.3d 


+0.750±0.051 


12.3c 


+O.694±O.06O 


12. 3f 


+0.626±0.070 


12.4 


+0.718±0.056 


12.5 


+0.738±0.053 


12.6 


+0.707±0.058 


12.7 


+0.687±0.061 


12.8 


+0.729±0.0S1 


12.9 


+O.723±0.055 



It is evident from this table that making any single one of the 12 
variables constant lias little effect upon the correlation between 
explosiveness and destructiveness of the epidemic mortality. The 
coefficients are all relatively high. 

Let us examine the effect of making all the demographic and 
environmental factors (subscripts 4 to 9, inclusive) constant at the 
same time. The coefficient is 

*•„.«•»=+ 0-706 ±0.058. 

This is almost absolutely identical with the zero order gross coeffi- 
cient given above. This result means that the factors ago and sex 
distribution, density and rate of recent growth of the population, 
latitude and longitude of the city (with all implied climatic differ- 
ences), have no influence in determining the correlation between 
explosiveness and total excess mortality rate. 

Taking in the biological (normal death rate) variables we have the 
following seventh-order coefficients: 

r IMWiM ,= + 0.675 ±0.063 
r„.,b« m »=+ 0-579 ±0.077 

^.2.30,56789=+ 0.71 i± 0.057 

»n.3d 4M 789= +0.749 ±0.051 

r 1Mwlt7I ,= + 0.700 ±0.059 

»"„.*«,„,= +0.632 ±0.069 

It is obvious that these normal death rates influence very little, 

either one way or the other, the correlation between explosiveness 

and destructiveness of the epidemic outbreak. 

One point of considerable interest which attaches to the rela- 
tively high correlation between these two variables is that one of the 
variables is measurable in time long before the value of the other 
can be possibly determined. The value of the explosiveness index 
/„ can usually be determined in from 2 to 4 weeks after the beginning 
of the epidemic, while the total excess mortality can only be measured 
when the epidemic is all over. With a correlation coefficient of the 
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magnitude of r 13 above, one can, by means of the regression equation, 
make a very fair prediction of the total excess mortality rate from a 
knowledge of the explosivcness of outbreak of the mortality measured 
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by /,.. Actually, as the different net correlations show, we shall, get 
practically as good a result by using the zero order coefficient and a 
simple equation of the form y = a + bx as by employing a many- cor- 
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stant partial regression equation. Actually the regression equation 
from the zero order r l2 is - 

# = 2.7412 + 0.1065 /„ 
where D denotes 25-week excess mortality rate and 7 is the epi- 
demicity or cxplosiveness index. This regression line is shown 
graphically in Figure 2. 



SUGGESTED HEALTH PROVISIONS FOR STATE LAWS RELAT- 
ING TO CHILDREN. 

Report of the Advisory Committee to the National Child Health Council on Health 
Provisions for Laws Relating to Children. 

In view of the fact that a number of States have children's code 
commissions at work, which arc dealing with various aspects of child 
welfare, it is especially desirable to call attention at this time to the 
need for child-health provisions in order that they may receive the 
consideration that they merit. Therefore, there is presented here a 
report of the advisory committee to the National Child Health 
Council on Health Provisions for Laws Relating to Children. 

The advisory committee is composed of the following members: 

Courtenay Dinwiddie, chairman, executive secretary, National Child Health 
Council, Washington, D. ('. 

James A. Toboy, secretary, assistant director, department of health service, 
American Red Cross, Washington, D. C. 

Richard A. Bolt, M. P., general director, American Child Hygiene Association, 
Baltimore, Md. 

E. Dana Caulkins, manager, National Physical Education Service, Washington, 
D. C. 

Taliaferro Clark, medical officer in charge of field investigations in child hygiene, 
United States Public Health Service, Washington, I). C. 

Edward N. Clopper, assistant secretary, National Child Labor Committee, New York 
N. Y. 

Anna E. Rude, M. 1)., director, division of hygiene, Children's Bureau, United 
States Department of Labor, Washington, D. C. 

Willard S. Small, Ph. P., specialist in school hygiene, Bureau of Education, Depart- 
ment of the Interior, Washington, D. C. 

FOBEWOED. 

Inasmuch as health is of paramount importance to child life and as 
it has often received minor consideration in State children's codes, the 
Committee on Health Provisions for Laws Relating to Children wishes 
to emphasize the need for adequate treatment of this subject by all 
States. The following are points which should be borne in mind. 



